Picture for Tom Goldstein

Tom Goldstein

Multi-Token Prediction via Self-Distillation

Add code
Feb 05, 2026
Viaarxiv icon

Antidistillation Fingerprinting

Add code
Feb 03, 2026
Viaarxiv icon

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

Add code
Nov 10, 2025
Viaarxiv icon

RL Is a Hammer and LLMs Are Nails: A Simple Reinforcement Learning Recipe for Strong Prompt Injection

Add code
Oct 06, 2025
Viaarxiv icon

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

Add code
Jul 22, 2025
Figure 1 for Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Figure 2 for Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Figure 3 for Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Figure 4 for Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
Viaarxiv icon

Speedy Deformable 3D Gaussian Splatting: Fast Rendering and Compression of Dynamic Scenes

Add code
Jun 09, 2025
Viaarxiv icon

ARGUS: Hallucination and Omission Evaluation in Video-LLMs

Add code
Jun 09, 2025
Viaarxiv icon

A Fictional Q&A Dataset for Studying Memorization and Knowledge Acquisition

Add code
Jun 05, 2025
Viaarxiv icon

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Add code
Jun 05, 2025
Viaarxiv icon